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Abstract 

This paper builds upon several recent works, where semigroup proofs of Brascamp-Lieb inequal- 
ities are provided in various settings (Euclidean space, spheres and symmetric groups). Our aim is 
twofold. Firstly, we provide a general, unifying, framework based on Markov generators, in order 
to cover a variety of examples of interest going beyond previous investigations. Secondly, we put 
forward the combinatorial reasons for which unexpected exponents occur in these inequalities. 
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1 Introduction 

A celebrated inequality of Brascamp and Lieb [H [TH] asserts that given linear surjective maps between 
Euclidean spaces Bi : H — > Hi, i = 1, ... ,771, and given positive coefficients (cj)^, the best constant C 
such that for all non-negative measurable functions fa : Hi — > M it holds 



W.n{B lX rdx<C\{{ / h{y)dy 
A i=1 i=1 \J H i 



can be computed by requiring the inequality on centered Gaussian functions only (i.e. of the form 
fi = e~ ( ^ i where Qi is a positive definite quadratic form). This far-reaching extension of Holder's 
inequality found applications in harmonic analysis but also in convex geometry. Indeed, a particular 
case called the geometric Brascamp-Lieb inequality, put forward by Ball [2] when dim(iJ;) = 1, leads 
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to many precise volume estimates. The general geometric version corresponds to the case when for all 

1 = 1, . . . , m, BiB* — l&Hi and J^. CiB*Bi = Id#, where B* is the adjoint of Bi. Under these hypotheses 
the optimal constant in the Brascamp-Lieb inequality is C = 1 . More concretely: let E\ , . . . , E m be vector 
subspaces of M™ with its canonical Euclidean structure. Denoting by Pe ; the orthogonal projection onto 
Ei, if J2% c iPEi — Mr" then for all measurable functions fi : — » K + it holds 

„ m rn , „ \ a 

/ ]\h(PE t x) C 'dx<l[( fi] . 

J * n i=i i=i / 

There exist by now many different proofs of the Brascamp-Lieb theorem: symmetrization when 
dim(ifj) = 1 [8^ study of Gaussian kernels [15] . optimal transport [5]. Heat flow derivation were 
presented in the recent works [TT] for dim(_ff;) = 1 and [7] in general: the geometric Brascamp-Lieb 
inequality is established by interpolating between the left and right hand side of the inequality, thanks 
to the Heat semigroup. The case when optimal Gaussian functions exist follows from the geometric case 
by a clever change of variables and turns out to be generic (the non-trivial remaining cases are in a sense 
"boundary" cases and can be decomposed into simpler ones). So the geometric case is also essential 
from a theoretical viewpoint. The Heat flow proofs required a more precise study of the structure of 
the problem, since the hniteness of the constant and the existence of Gaussian maximizers have to be 
treated beforehand. They lead to a complete treatment of the equality cases [TTJ [7J [50] • They were 
also flexible enough to adapt to other ambient spaces, as observed by Carlen, Lieb and Loss [TT] who 
discovered the following Young type inequality on the Euclidean sphere S™ -1 : for all measurable functions 
fi : [-1,1] -►R+, it holds 

, n m , si 

] §n , M*) ^ II [J Sn t fit%i) 2 d0-(x)) , (1) 

where a is the uniform probability measure on This inequality can be understood as a correlation 

inequality: the coordinates of a uniform random vector on the sphere are not independent, so there is no 
Fubini equality. Instead, inequality (Q]) holds and is a lot better than Holder's inequality, which would 
involve L™-norms of the functions. In a sense, the exponent 2, which turns out to be optimal, shows that 
the coordinate functions are not too far from being independent. The above inequality was extended to 
a spherical version of the geometric Brascamp-Lieb inequality in [5] . Carlen, Lieb and Loss also proved 
a similar inequality for the set of permutations of a finite set and coordinate functions [12] . 

In this paper we provide a general framework based on Markov generators which allows us to unify 
the existing results, derive extensions and clarify the conditions which are required to prove correlation 
inequalities. Decompositions of the identity as (|10p play an important role. In the case of functions 
depending of blocks of coordinates, we put forward a general set of conditions, which is similar to the 
hypotheses of Finner's theorem for product probability spaces [15) . but applies to particular non-product 
spaces. See e.g. Propositions [TU1 12"01 and Section 14.2.41 

The structure of the exposition is as follows. The abstract framework is described in Section [5] where 
a general condition is stated. The next sections provide concrete illustrations of Proposition [TJ Section[3] 
deals with the case where our Markov generator is a diffusion, as it is the case in some classical geometric 
and probabilistic situations. In particular, we shall put forward the algebraic content of our condition in 
the case of Riemannian Lie groups (with emphasis on the orthogonal group SO(n)) and their quotients. 
We study discrete models and their combinatorics in Section[T] and the case where the generator is a sum 
of squares in Section^ The final section is devoted to related - dual, more precisely - entropy inequalities 
for the marginals of a probability distribution. We state there an abstract superadditive inequality for the 
associated Fisher information, that leads to a somewhat different route to Brascamp-Lieb inequalities. 

2 The abstract argument: commuting maps and BL-condition 

The basic input is a measurable space E and a Markov semigroup {Pt)t>o acting on functions on E, with 
generator L. We do not discuss here the various questions related to the underlying domain of L and its 
associated carre du champ operator (see below) as well as the classes of functions under consideration. 
When a given inequality on functions is stated, it is always understood relatively to the suitable domains 
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of (Pt)t>o, L or r. These are clear in all the continuous or discrete illustrations in this work. We refer 
to [T] for an introduction and further details in this respect and to [T3] for the discrete setting. 

The general framework of our study is the following. We introduce m > 1 measurable spaces Tj and 
maps Ti : E — > Tj, i — 1, . . . , m. We assume that, for each i — 1, . . . , m, that the map T commutes to 
P t or L in the sense that for every g : Ei — ► R, T(<? o Tj) factors through Tj: 

°Ti) — g oTi (2) 

for some g : Tj — » R. In other words, T (or P t ) leaves invariant the algebra of functions on E of the form 
g o Tj. This means that or T may be projected on Tj and there exists a Markov generator Tj on Ei 
such that 

L(g o Tj) = (Lig) o Tj. 

We denote below by (P t *) t>0 the semigroup with generator Tj. If follows that Pt(g o Tj) = {Pig) ° Tj. 

We aim at understanding how the "geometry" or the "combinatorics" of the Tj 's and of appropriate 
choice of constants a > ensure that 

(m \ m 

n/f °^ <n( p *(/> oT *)r 
i=i / i=i 

for all /j : T, -> R+, i = 1, . . . , m. Since (P t (P 1 / c )) c < (P t (F^ d )) d for c > d > 0, we would like to pick 
the largest possible constants Cj's. Also, for obvious reasons (pick all the /j but one to be identically 1), 
the Cj's will belong to (0, 1] and the inequalities we consider can be rewritten in terms of L Pi -norms for 

Pi = 

This problem is of course reminiscent of the Brascamp-Lieb convolution inequalities described in the 
introduction, and it can as well be interpreted as a correlation problem. This correlation problem has 
many ramifications, as we shall see. 

We will, in this general framework, be dealing with inequalities which are valid for the measures 
Pt(.)(x), uniformly on the point x. The following main equivalence is implicit in [llj . pQ. 

Definition-Proposition 1. Let Ci be non-negative reals and Ti : E — > Tj maps commuting with L, for 
i = 1, . . . , m. We say {cj, Tj} satisfy the BL-condition if: For all functions Tj : E — ► M, i = 1, . . . , m, of 
the form Tj = gj o Tj, setting H = Yli=i c iPij it holds 

m 

e- H L{e H ) < ^ Cl e-^T(e F 0. (3) 

i=l 

Then, the following are equivalent: 

• For all non-negative functions /j : Tj — ► R, i = 1, . . . , m, and every t > 0, 

> m \ m 

Pt(l[f?°T i \<l[(P t tf i oT$ ei . (4) 

M=l ' i=l 

• T/ie {cj,T} satisfy the BL- condition 

Proof. Let /j : Tj — > R, i = 1, . . . , m, be bounded positive functions. Let t > and consider 



a(«) = P s j^exp c, log P t _ s (/j oTj)JJ, < s < t. 
Set Tj = logP t _ s (/j o T), i — 1, . . . ,m, and = X)I=i c i^- Direct calculations give 

a'{s) = P s ( T(e ff ) - e H ^ Cj e~ Fi L(e Fi ) 



Next, by the commutation property (T5|), Tj = logP t _ s (/j o Tj) is a function of T so that, under ([3]), 
a'(s) < and thus a(0) > a(t). Hence ((U) follows from ([3]). The converse implication is obtained by 
differentiating <^ at t = 0. □ 
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Remark 2. Given maps Ti : E — > Ei, i = l,...,m, one may not always be able to check the BL- 
condition ©. It might be necessary to consider further bijective maps R : E — > E, Ri : Ei — > Ei and to 
deal with Ti — RioTiO R : E — ► Ei, i — 1, . . . , m, (still assumed to commute with P t ) instead of X^. This 
is exemplified by the paper [7j where the Gaussian-extremizable cases of the Euclidean Brascamp-Lieb 
inequality are reduced to the geometric Brascamp-Lieb inequality. Actually this change of variables is 
also implicit in [llj where the functions fi are evolving according to different semigroups. 

It is usually of more interest to state Brascamp-Lieb type inequalities with respect to the invariant 
measure /x of the semigroup (Pt)t>o- When (Pt)t>o is ergodic with invariant probability measure /i, we 
may let t — > oo in the local inequality ((H) and get inequalities of the type 

m m / « \ Ci 

]Jf- oT,d M <J] ( / fioTtdfi) . (5) 

i=l i=l ^ ' 

Actually this can be viewed directly by studying f3(t) = JYliPtifi ° Ti) Ci dfi. Indeed with the notation 
of the above proof 



/?'(*) 



J e " f|>e- F *£(e Fi )J dn = - J (^L{e H )-e H Y^c. l e- F *L{e F ^dii. 



Hence integrating from to oo, the BL-condition ^ yields ([5]). Note that the condition (3'(t) > may 
be rewritten in terms of the Dirichlet form £{f,g) := / f(—Lg)dfi as 

m 

^ C! f(e^,e f -)<0. 

i=l 

Remark 3. If (Pt)t>o has an infinite invariant measure /z, more hypotheses are needed to get a meaningful 
limit to the local bounds as t — > oo. Assume that (Pt)t>o is of dimension n, and size k > 0, in the sense 
that for every /x-integrable function / : E — > K, at any point, 



lim t n ' 2 P t f = K [fdfi. 

t^oo J 



If the semigroups (Pf)t>o have invariant measures dimensions and sizes i = 1, . . . , m, and if in 
addition J2i=i c i n i — n i we mav use Pt(fi ° Ti) = P/(/j) o 1} and let i — > oo in (jlj to get 



/m m / /. 



3 Examples of diffusion semigroups 

This section is devoted to several examples of illustration of the preceding abstract scheme in case the 
generator L satisfies a chain rule formula. Recall that the carre du champ of the generator L is defined 
on some suitable algebra of functions by 

T(f,g) = ^(L(fg)-fLg-gLf). (6) 

For simplicity one writes for /). If L is a diffusion generator (i.e. a linear differential operator of 
order 2 without constant term), then the chain rule yields L(e^) = (i/+r(/)). So for H = Y^T=\ c i^i, 

m m 

e- H L(e H ) - c« e- F 'L(e F >) = T(H) - $>r(*i)- 
i=l i=l 

Hence, we have: 
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Fact 4 (BL-condition in the diffusion case). If L is a diffusion operator, then the BL- condition © is 
equivalent to saying that for every functions fi : Ei — > R, i = 1, . . . , m, 

(tit \ m m 

c > fi° T i) = Yl r tf< ° T *> £ ° T ^ ^ E c * ° T >)- ( ? ) 
i=l ' i,i=l i=l 

Depending on the structure, this condition may be expressed more intrinsically in terms of the 
operators Tj. We investigate several instances below. 

3.1 Riemannian manifolds 

Let us assume that E is a Riemannian manifold and that T(f) — |V/| 2 . This is in particular the case if 
Pt is the Heat equation on E associated to the Riemannian Laplacian A. We also assume that the maps 
Tj are differentiate. Then condition (J7J) amounts to the fact that for every x £ E, and for all smooth 
functions fi, 

rn 2 m 

I^CiVCftoTiXz)! <J2 c i\^(fi° T i)( x )\ 2 - ( 8 ) 

i=l i-1 

For each x £ E, we introduce the subspace of T X E, the tangent space at x, 

£i{x) := {V(/i o Ti){x); f t : Ei -» R} C T a £. (9) 

This is the orthogonal of the kernel of DTi(x), so it is orthogonal to the tangent directions of the level set 
{y G E; Ti(y) — Ti(x)}. We denote by Ps^x) the orthogonal projection on £i(x) in the Euclidean space 
T X E. We can reformulate |(SJ) using the following well known equivalence, which relies on the fact that 
a linear map and its adjoint have the same norm: For £ a Euclidean space, £i, i = 1, . . . , m, Euclidean 
subspaces of £ and c\ , . . . , c m > we have: 

V«i S £i, j^CiWi <^ j c i \v t \ 2 <^=^ Vuef, c t I Pejv\ 2 < \v\ 2 

i—1 i—1 i—1 

writing P$ i for the orthogonal projection onto £{. More concisely, denoting the identity map by Idg, the 
latter condition rewrites as an inequality between symmetric maps: Xh=i °iPSi — 

Therefore, we see that BL-condition amounts here to a "moving decomposition of the identity" 
inequality in all tangent spaces. 

Fact 5 (BL-condition in the Riemannian case). In the setting described above, the BL- condition ^ is 
equivalent to saying that for all x S E, 

m 

J2 CiPSii*) < Id T x E- (10) 
i=l 

Next, we present instances of such decompositions in the case of model spaces. 

Geometric Brascamp-Lieb inequality in Euclidean space. In R™, let, for i = 1, . . . ,m, Ei, be vector 
subspaces of dimension n, > 1 and let Cj > 0, such that 

m 

ciP Ei = id R ~ 

i=i 

We take of course T { : R" -> Ei such that Ti(x) = P Ei x, x € R n , i = 1, . . . , m. 

If B is a linear map, V(/ o B){x) = ^/{Bx) and A(/ o B){x) = r Ti( t BEessf(Bx)B). It is then 
clear that the generator L — A — x ■ V of the Ornstein-Uhlenbeck semigroup commutes with the T/s. 
Also for all x £ R™, the spaces £i(x) are simply Ei. Hence (fTU|) is guaranteed by the decomposition of 
the identity induced by the E^s. Thus, we get a Brascamp-Lieb inequality for the standard Gaussian 
measure, which is ergodic for the Ornstein-Uhlenbeck semigroup: 



r m a m / f a 



27r) n / 



n(7 

,--1 V-'Ei 



-|y| 2 /2. 



dy 



(2tt)«*/ 2 
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Note that the decomposition of identity rewrites as YliLi c i\PEiX\ 2 — \x\ 2 , hence setting gt(y) — 
fi(y) exp(— |y| 2 /2) and using the condition n = CiUi (take traces in the decomposition of the identity), 
we obtain the Euclidean inequality 

/ U^ p E^) C 'd^<Y[( / 9i(y)dy) ■ 
M« i=1 i=1 \Jb, J 

Alternatively we could have used the Heat semigroup (with generator A) to get a local inequality and 
pass to the limit using the dimension of this semigroup, as explained in the Remark [31 

Geometric Brascamp-Lieb inequality on the sphere. The first inequality of this type was established 
by Carlen, Lieb and Loss [TT] for coordinate functions on the sphere. It involves an unexpected exponent 
2. A natural extension in the spirit of the latter Euclidean inequality was given in [5]. It reads as follows: 
If x € § n_1 C K" (the standard (n — l)-sphere), set as before Tj(x) = Pe^x), i = l,...,m, where 
Ei C M. n are subspaces for which we have 

m 

2J c i p Ei < Mr™ ■ 

i=l 

Then, whenever fi are non-negative measurable functions on the sphere, such that fi depends only on 
Ei (that is fi(x) — (^(P^ (2)), for the uniform probability measure a on S™^ 1 we have, 

/ W^ ,2 d<r<\{{ fid*) . 

It is easy to see that the Laplacian on S™" 1 commutes to the operators Ti. The strategy in [5] is to de- 
rive decompositions of the identity in all tangent hyperplanes to the sphere, thus fulfilling Condition (|10p . 
Another approach based on analysis on the orthogonal group will be given next. 

Hyperbolic space. It is natural to ask for an hyperbolic analogue of the previous statement. Let us 
explain, in two dimensions, why the method does not give any interesting correlation inequality. The 
natural functionals Ti to consider are the Busemann functions (which basically are the coordinate in the 
direction of a point at infinity), they commute with the Laplace operator. In the disk model, choose 
61, . . . , b m on the unit circle and let Ti be the corresponding Busemann functions. At a point x in the 
disk the directions £i(x) are simply the lines spanned by the gradients of the T/s (the tangent to the 
geodesic passing through x and going to hi). When x tends to a point at infinity b which is not one of the 
&^s, it is clear that the lines £i(x) become asymptotically parallel to the line Mb. Hence if a decomposition 
of the identity exists in all tangent planes we get that ^ c, < 1. But in this case the decomposition (jlOp 
is trivial since Psi{x) < Id, and the inequality that we get is nothing else than Holder's inequality. 

3.2 Riemannian Lie groups 

In the case of Lie groups (and their quotients), the geometric structure required to have Brascamp-Lieb 
type inequalities is very clear and elegant. 

The algebraic structure of the problem appears clearly when functions depending only on some 
variables are seen as functions invariant under the (right) action of subgroups of isometries. For instance, 
a function f(x) on WL n is a function of x\ if and only / is invariant under all translation leaving e\ = 
(1,0,..., 0) invariant. Note also that a function f(x) on the sphere S*™ -1 C R" is a function of x\ if and 
only if / is invariant under all rotations leaving e\ invariant. In this section, we shall extensively use this 
point of view in the case of compact Riemannian Lie group. 

Let G be a connected compact Riemannian Lie group with unit element denoted by e. Let Q = T e M 
be associated Lie algebra ; by assumption, Q is a Euclidean space. Let \x be the normalized bi-invariant 
Haar measure on G. Here we will work with the Laplace-Bcltrami operator A as Markov generator, for 
which we indeed have that 

r(/) = |v/| 2 , 

as required in the previous section. 



G 



Let Gi be a connected Lie subgroup of G, with Lie algebra Gi C Q. A function / : G — ► R is said to 
be Gi-right-invariant if 

f(xg) = f(x), VjeG,, VxeG. 

Equivalently / is of the form g o Tj where Tj : G — > G /Gi is the canonical projection onto the right- 
quotient, defined by Tj(x) = sG^. In other words, using notation ([9|), we are interested in the case where, 
for x £ G, 

£i(x) — {V f(x) ; / : G — > R is G^-right-invariant}. 
If / is Gi-right-invariant, then for all v G 0, and all teR, 

/(«xpH)=/(a:), VxgG. 

If / is differentiable, we get that 

d 





dt 



f(xexp(tv)) = (Vf(x),d(L x ) e v), Vv g ft, 
t=o 



where i K : G — ■+ G is the left-multiplication by x. Since L x is an isometry of G, its differential at e, 
d(L x ) e , is an isometry between the Euclidean spaces T e G — Q and T X G. In particular, we will exploit 
the invariance property in the following form 

{d{L x ) e )- 1 Vf{x)eGt, VxeG (11) 

Roughly speaking, a G^-right-invariant function / "depends" only on Gi in the sense that the gradient 
V/(x) is in the direction transported on T X M: 

£i{x) = d(L x ) e £i, 

setting Si := . With this formalism, the condition to have a Brascamp-Lieb inequality boils down to 
the existence of a decomposition of the identity in the Lie algebra: 

Theorem 6. Let G be a connected compact Riemannian Lie group. Let (Gi)™^ be connected Lie sub- 
groups and let Si :— Q^be the orthogonal complements in the Lie algebra Q of G of their Lie algebras 
{QifiLi- Assume that for given di, ... ,d m > the following inequality holds between symmetric linear 
maps of Q : 

d 

5> 2 P £i <Id e . (12) 

i=l 

Then the BL -condition ^ is satisfied. Ln particular, if for i = l,...,m, fi : G — * R + is Gi-right- 
invariant, it holds 

rn m , „ s di 

»=1 i=l ^ G ' 

Proof. We consider the Heat kernel on G. The Laplace Beltrami operator commutes with right multi- 
plication by the elements of the group so that the commutation relation is verified, in particular Ptfi is 
again G;-invariant. Next let us check condition in the form |5]) put forward in the beginning of the 
Riemannian case. If for i < n, hi is a differentiable G^-invariant function then, then, rewriting (|11[) as 

d(L x -i) e Vhi(x) g Si. 

we get P£, i d[L x -\ ) e V/ij(x) = d(L x -i) e \Jhi(x). Using the fact that d(L x -i) e is an isometry between 
T X M and Q and the decomposition of the identity in Q, we see that 

j^2diVhi(x) = |(dL a .-i)(5^djVft<(a:))| = ||5^di(dL a .-i)V^(a!) 

< J^di (((dL^-OV/i^a;)!! 2 = ^d t ||V/i 4 (x)|| 2 . 

i i 

The result follows. Equivalently, we could have said that the isometry dL x pushes forward the decom- 
position (fT2|) from Q = T e G to the decomposition (fTQ|) on T X G. □ 
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3.2.1 Calculations in SO(n) 

We consider subgroups related to the natural action of SO(n) on R™ and study the relationship between 
decompositions of the identity of R™ and the ones induced on A n = so(n), the set of antisymmetric 
n x n matrices which is the Lie algebra of SO(n). The Euclidean structure on A n is given by the 
Hilbcrt-Schmidt norm and the corresponding scalar product (A, B) = Tr(*^4i?) = — Tr(AB). 

We will consider as before functions on SO(n) which are right-invariant with respect to subgroups. 
There exists two natural subgroups associated to a subspace E C R": Fix(E) and Stab(-B). 

Lemma 7. Let E be a vector subspace o/R™. Consider the group 

H = Fix(E) := {U e SO(n); U\ E = Id} 

and let Ti be its Lie algebra. We have Ti = {A £ A n ; A\ E = 0} and if P £ : A n — ► A n denotes the 
orthogonal projection onto £ := Ti ± , we have that 

\\P £ (A)\\ 2 = 2\\P E A\\ 2 - \\P E AP E \\ 2 , e A n . 

Moreover a function f : G — > R is H -right-invariant means that f(U) is actually a function of U\ E - 

Proof. The equality Ti = {A £ A n ; Ai E = 0} is obvious. Let us check that the orthogonal projection of 
A E A n onto Ti is P E ±AP E ±. Indeed the latter is clearly antisymmetric and vanishes on vectors of E, 
so it belongs to H. It remains to check the orthogonality condition: if B £ H, 

-{B,A-Pe±AP e ±) = Tv(b(A-Pe±AP e ±-)) 

= Tv(BA)-Tr(BP E ±AP E ±). 

Since B vanishes on E, B = B(Pe + Pe-l) = BP E ± and taking adjoints Pe^B = B. It is then clear 
that Tr(BP E ±AP E ±) = Tr(BA). The orthogonality follows. 

Since £ = Ti 1 - and denoting for shortness P instead of P E , and / instead of Idgn , we have 

P £ (A) = A- P E ±AP E ± =A-(I- P)A(I - P) = PA + AP - PAP. 

Eventually since Pg is a self-adjoint involution 

\\P £ (A)\\ 2 = (A,P £ A) = -Ty(A(PA + AP-PAP)) 

= -2Ti(A 2 P) + Tr(APAP) = 2\\PA\\ 2 - ||PAP|| 2 . 

The statement on i7-right-invariant functions is easy. Such a function can be viewed as a function on 
SO{n)/H ss SO(n)/SO(E ± ) which can be identified to the Stieffel manifold of orthogonal frames of 
size dim(^) in R n . More explicitly, UiH = U2H is equivalent to U^Ui £ H, that is for all x £ E, 
Ui(x) — U2(x). Hence the restriction of U to E characterizes the class of U in the quotient. □ 

Lemma 8. Let E be a vector subspace ofM. n . Consider the group 

H = Stab(£7) := {U £ SO(n); U(E) c E} 

and let Ti be its Lie algebra. If P £ : A n — » A n denotes the orthogonal projection onto Ti , it holds 

\\P £ {A)\\ 2 = 2\\P E A\\ 2 - 2\\P E AP E \\ 2 , VA e A n . 

Moreover a function f : G R is H -right-invariant means that f(U) is actually a function of U(E). 

Proof. The argument is very similar to the one of the previous lemma. First note that 

H = {U £ SO(n); U(E) = E} = {U £ SO(n); U(E) C E and U{E^) C E^}. 

For a H right-invariant function /, f(U) depends only on UH. Since U\H = U2H is equivalent to 
Ui(E) = U2(E), the quantity f(U) depends on U(E). In other words / factors through the Grassmann 
manifold of spaces of dimension dim(_E) in R™. 
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One easily checks that H = {A £ A n ; A(E) C E and A(E ± ) C E- 1 }. The orthogonal projection for 
A E A n onto 7i is PeAPe + Pe±AP e ±- Indeed this is clearly an antisymmetric map for which E and 
E 1 - are stable. Moreover for B 6 7i, it is clear that B = PeBPe + Pe±BP e ±- Hence 

— (B, A - P E AP E + P E ±AP E ±) = Tr(BA) - Tt(BP e AP e ) - Tt(BP e ±AP e ±) = 0. 

Eventually, since £ = TL , Ps(A) = A — PeAPe + Pe±AP e ± ■ So calculating as in the previous lemma, 
we have P £ (A) = PA + AP- 2PAP and 

\\P £ {A)\\ 2 = (A, PgA) = -Tr(A(PA + AP - 2PAP)) 

= -2Ty{A 2 P) + 2Tr(APAP) = 2\\PA\\ 2 - 2\\PAP\\ 2 . 

□ 

The connection between decompositions of identity of E" and of A n is explained next. 

Proposition 9. For i = 1, . . . , m, let Ci > 0, Ei be a vector subspace o/K n and let Gi be either Fix(-Ej) 
or Stab(-Ej). Denote by £j = Q± the orthogonal of Qi (the Lie algebra of Gi) in A n . We have 

m m 
i=l z=l 

^4s a consequence, if X)I=i ^^Ei _• He» i/ien inequality (|13[) /io/rfs on G = SO(n) (equipped with its 
uniform probability measure fi ) whenever each fi(U) is a function of U (Ei) or of , i = 1, ■ ■ ■ ,m. 

Proof. By Lemma [7J and Lemma |U for any A G A n , II -Pf, (^4) II 2 < 2||P Ei A|| 2 . Hence 

m mm 

i—1 i—1 i—1 

rn 

= Ti-^A^^Pe^a) < TtCAA) = \\A\\ 2 . 

i=l 

□ 

Note that we have not used the full strength of Lemmata [7] and [HI since we have discarded the terms 
HPejAPjjJI 2 . However, in the case where the Ei's are one dimensional subspaces of R n , these terms 
vanishes, since in this particular case we have 

P E% AP E% = 0, 

So, if Ei = Ruj where the Uj's are norm 1 vectors satisfying the decomposition of the identity 

m 
i=l 

where Ui®Ui = Pe 4 , then we have, with the notation of the Proposition, 

We do not loose in the passage to the Lie algebra. A particular case of interest is when m = n, 
c\ = . . . = c n = 1 and (u±, . . . , u n ) is an orthonormal basis of W 1 . 

For higher dimensional -Ei's, it is possible, in some specific situations, to recombine the terms 
HPejAPeJI 2 to recover a multiple of ||A|| 2 and to improve the exponents in the correlation inequal- 
ity. This is easily seen for coordinate subspaces, i.e. spaces spanned by vectors of the canonical basis 
(ei, . . . , e„) of W 1 (or of any given orthonormal basis, of course). The following proposition puts forward 
a typical set of conditions in order that BL-condition ([3]) is fulfilled. It will appear later in similar forms. 
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Proposition 10. Let T be a collection of subsets of {1, . . . , n}. Assume that it is written as a disjoint 
union X = T\ U2"2- For each nonempty subset / el, let Cj > 0, Ei := span(ei; i G I) and fi : SO(n) — > 
R + such that 

• if I G X\ then for all U, fi(U) only depends on U\ El , 

• if I € T.% then for all U, fi(U) only depends on U(Ej). 
If for all 1 < i,j < n with i 7^ j it holds: 

E °i+ E c '^' 

ln{i,j}jt@ card(/n{i,j}) = l 

then BL- condition^ is satisfied and in particular 



I n/^M<n(7 

JS0(n) IeI I£X \JS 



ci 



fidfi] . 

SO(n) J 

Proof. Simply note that for A = (ai,j)i<i,j< n G Ai, \\Pe i AP Ei \\ 2 = J2i,jei a i,j an( ^ 

n 

\\P El A\\ 2 = Tr(* AP Ei A) = Tr(]T Ae 4 ® Ae *) = E U^f = E E 

i£7 i£I iGJ 3=1 



Let us set Xj := 1 if I G I\, Xj := 2 if / G I2 ■ Using Lemmata [7] and [3 and the antisymmetry of A G A, 
we have 

5>||P £j (A)|| 2 = 5> (2||P Bl A|| 2 - XxWP^APe^) 
1 1 

= E c m 2 EE^-^E°L-) = E^( 2 E c '- E A ^ 

7 y ie/ 3=1 ij'e/ / «,3=i \ i; iei I; 

= 2 E E c '+ E c '- E A ' c ' 

l<i<3<n yi; ie/ I; jel I; 

The latter is upper bounded by ||A|| 2 as soon as for all i 7^ j, 

which is exactly our hypothesis on the coefficients (c/)/ e x. Hence J2i c iP£i — an d Theorem [6] 

yields the claim. □ 

Let us restate the previous result in the case we are looking to inequalities involving identical Cj's. 

Proposition 11. Let 2 be a family of subsets of {1, ... ,n}, and consider 

p :— max card{/ G I; I n{i,j} 7^0}, 

l<i<j<n 

q := max cardj/ G I\ card(/n {i, ?'}) = l|, 
then for all non-negative functions gi, hi defined on suitable spaces, 

1 

f ]jgi(u lEl )d^u) < n ( f gi(u ]El r d»(u)Y , 

i 

fWhjiuiE^d^u) < Y[(l ' h I (u(E I )y d^u) 

•' lei lei ^ 
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Let us put forward two particular cases of application of the previous result: 

• Blocks of coordinates: if X is a non-trivial partition of {1, . . . , n} then each pair {i,j} meets at 
most two sets in the family and we get p — q = 2. 



Loomis- Whitney inequality: if X is the family of all subsets of {1, . . . , n} of size k, then any pair 



meets (?J — ( n fe 2 ) sets. Hence we have 



P : 



n\ /n-2\_/n-l\ fn-2 
k) ~ [ k ) ~ [k- l) + U-l 



However the number of sets of cardinality k which intersect a given pair in exactly one point is 
(fc) _ (™fe 2 ) ~ (fc-a) = 2 (fc-i)- So we § et a smaller exponent 

It is worth noting that a direct application of Proposition [9] would have given worst estimates (when 
k > 2), in both cases. Indeed, if we denote by P 1 the projection onto a subspace spanned by {a, i € 7} 
for 7 c {1, . . • n}, we have 

and therefore we would get exponent p and q equal to 2— (?) = 2(?_.,). 

Remark 12. On can take advantage of the terms ||PeAPa|| 2 in more general situations. They have to 
be rather symmetric though. Letting 2 < k < n — 1, one instance is given by the family of all the spaces 
spanned by any k vertices of a regular simplex in K™ with center of mass at the origin. 



3.2.2 Passing to quotients 

So far, we have taken advantage of right-invariances of the functions fi. Plainly, similar results hold if all 
the functions are left-invariant instead. It would be very interesting to get better inequalities when the 
functions fi enjoy left and right invariances together (this would encompass functions on SO(n) depending 
on matrices U only through submatrices). Unfortunately, our approach does not give interesting general 
results in this direction (nothing better than what one gets by applying first Holder's inequality in order 
to get two integrals; each of these integrals is then upper-bounded by using only one-sided invariance). 
In the specific case when the functions have different right-invariances and a common left invariance, 
our results can be stated instead on the left-quotient. This is a way to get inequalities for homogeneous 
spaces corresponding to a compact Riemmanian Lie group. 

Let us illustrate this remark for the sphere: if Ei is a subspace of 1" and fi : §™ _1 — > K + is of the form 
f^x) = gi(P Ei x), we may introduce F t : SO{n) -> R+ defined by F % {U) = g t (P Et t Ue 1 ) = g l { t (UP Et )e 1 ). 
Then Fi is Fix(7^)-right-invariant and also Fix(Rei)-left-invariant. Applying our results on SO(n) 
and using the fact that the law of t Ue\ under the Haar probability measure on SO(n) is the uniform 
distribution on the sphere recovers the main result of [S] which extends inequality (JTJ) : if CiP Ei < IdRn 
then 

Moreover, if / : S"" 1 -> M+ is of the form f(x) = g(\P E x\), then the function F : SO{n) -> R+ 
defined by F(U) — g(\P E t Uei\) is Stab(7^)-right-invariant and Fix(Rei)-left-invariant. This allows us 
to transfer all of our SO(n) results to the sphere. 

Actually, a more general route is to note that BL-condition, in the form (| 1 2 [) . passes to quotient. 

Lemma 13. Let E be a Riemannian homogeneous space and G a compact Riemannian Lie group of 
isometries acting transitively on E. Assume we are in the situation of Theorem^ A function f : E — > R 
is said Gi-invariant if f(g ■ x) — f{x) for every x G E and g 6 Gi. We can consider the associated 
Ti : E — > E /Gi or more simply, with the notation 

£i(x) — {V/(a;) ; / : E — > R is Gi-invariant}. 

If condition (|12p holds on G, then the BL- condition holds in E in the equivalent form (|10[) . 
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Proof. Fix x € E and let G x = {g € G; g ■ x — x}. Then, if we decompose the algebra Q — T\^G 
(equipped with its Euclidean structure) as an orthonormal sum G — Gx® G x where Gx is the Lie algebra 
associated to G x , we have that Gx is isometric to T X M by the isometry map 

7r = 7r x : A — ► tt(-A) = ~r expUA) ■ x. 

at |t=o 

We see that n(Gi) C Siix) 1 - and therefore £i(x) C 7r(£,). Note that G x c Gi and £, c Since 
^7r(f*) — TtPsi tt~\ we get from (fT2j) that 

m 
z=l 

□ 

It is sometimes necessary to work directly on quotients, in particular for quotients of finite measure 
with a cover of infinite measure. We briefly discuss the example of the flat torus (K/Z) n . We consider 
for i = 1, . . . , to, rational vectors it, € Q™. For each i let £^ be the largest common divisor of the numbers 
(ui, ei), . . . , (ui,e n ). In order to define the map x i— ► (x,Ui) on the torus, one has to identify (ui,ek) 
to for all k. This amounts to quotient E by T,T=A u ^ e k) z = Let T i : (K/Z) n -> M/^Z be the 
map defined by Tj(x) = (x,Ui) mod ^. One easily checks that the Laplacian commutes with 7$ (same 
calculation as in 1"). Since for every x, V(/j o Ti){x) is a multiple of iti, if 5Zj=i c « u i ® M * — Idii" it 
follows that 

/ l\M(x,u t )rdx<f]( f Mix^^dx) =f[( [ A ■ 

J(R/Z)» Y i=l V/P*/ 2 )" / f=l V^K/^ 2 / 

3.3 Dirichlet distributions and their relatives 

For x € K", we set S(x) = x\ H — - + x n . Let a e (0, +oo)™, then by definition the Dirichlet law Z?„_i(a) 
is the distribution of 

(Xj, . . . , Xn-i) 
Xi + ■ ■ ■ + X n 

where X\, . . . , X n are independent random variables such that for each i, Xi is Gamma(ai) distributed. 
More precisely it is supported on T„_i ={j/£ R™ -1 ; yi + ■ ■ ■ + y n -i < 1} and 

g w -!(a)(dy)= r ( II ^) (l " £ ^""^^.(y) dy. 

1L<« l a W i<n-l i<n-l 

In order to get more symmetric results, we prefer to work with another representation: we consider the 
law D n -i(a) of 

(Xi, . . . , X n ) 
Xt + --- + X n 

It is supported on the regular simplex A n _i = {y S R™ ; yi + • ■ ■ + J/n = 1} an d its density with 
respect to Lebesgue measure on A„_i is proportional to y i— > IIi<riy? i_1 - R- ecau that some Dirichlet 
distributions are closely related to uniform spherical measures. Indeed if Gi are independent variables 
with distribution cxp(— t 2 )dt/^/n, then the uniform measure on S w coincides with the law of 

(Gx, • ■ ■ , Gn) 
VM + ' ' ' + 

Note that Gf has distribution Gamma(l/2). Write iV = k\ + ■ ■ ■ + k„. It is then clear that the image of 
the uniform probability on S^ -1 by the map 

x i— > (x\ + ■ ■ ■ + x kl , x kl+1 + • ■ • + x\ i+k2 , . . . , xl iA — ^fc„_ 1 +i + ■ • ■ + x 2 N ). 
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is D n -\{k\/2, . . . , k n /2). This allows us to transfer some of our spherical results, but only to Dirichlet 
laws with half integer coefficients. In order to deal with general coefficients the following direct study is 
needed. 

The measure D„_i(a) is known (see [T4j [19] ) to be reversible and ergodic for the following Flcming- 
Viot operator 

L a f= ^2 x i d tif- E x * x J d tjf+ E (oti- S(a)xi)dif. 

i<~n— 1 i,j<n—\ i<n— 1 

In the symmetric representation associated to Z? n _i(a), it is natural to consider the operator L a defined 
for smooth functions / : M™ — > M + and for x £ A„_i by 

L a f{x) = ^2 x i d i,if( x ) - Y x i x 3 d ljf(x) + ^2 ( a i - S{a)xi)dif{x). 

i<n i.j<n i<n 

It is not hard to check that L a f only depends on the restriction of / to A„_i (in the intrinsic formulation 
dig is to be understood as Dg-Pnei — Dg- (e, — 1/n), where 1 = (1, . . . , 1) £ K™ and H = 1^). However 
it is convenient to be able to apply L a f to functions / defined on the whole space. For example if we 
write f(y) = g(y 1} . . . , y n -i), V € A n _i then it is clear that L a f(y) = L a g(yi, . . . , y n _i); hence the 
properties of L a will pass to L a f (in particular Z?„_i(a) is reversible and ergodic for the semigroup 
generated by L a f). 

The carre du champ of L a can be expressed in the following convenient form, for x £ A„_i: 



r(/) = ^xi&f) 2 - Y mxjdifdjf 

= Y,xmf? - (eW) 2 

i < n i < n 

= -Y^x iXj {dif -djf) 2 , 



where we have noted that T(/) is actually a variance with respect to the probability measure ^XjJj. 
The last formula comes from the representation Var(X) = ^E((X — X') 2 ) where X' is an independent 
copy of X. We are ready to establish 

Proposition 14. Let X be a collection of subsets of {1, . . . , n}. Assume that it is written as a disjoint 
union X = X\ U 2"2- For each nonempty subset I £ X, let cj > 0, and fi : A n _i — > R + such that 



if I £ X\ then for all x, fi{x) only depends on (xk)k 



• if I £X% then for all x, fi(x) only depends on ^2 keI Xk- 
If for all 1 < i, j < n with i ^ j it holds: 

E c '+ E c^ 1 . 

ln{i,j}jt@ card(/n{i,j}) = l 

then the BL- condition ([3]) is satisfied and if X is D„_i(a) distributed 

E(l[f?(X)) <U{Ef z (X)] 

Proof. First, we check the commutation relations. Since the coordinates play symmetric roles, we may 
assume that I = {1, . . . , k}. Also we may extend our functions to R" . If for all x, g(x) = f{x\ + ■ ■ ■+x n ) 
it is obvious that 

„ , s f if i > k i e>2 / \ f if i or j > k 

dM = { fix, +.- + XU) iU<k and d ^ = { f"( Xl +... + x k ) if i, j < k. 
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It is then clear that L a g(x) is a function of X\ + • • • + Xk- Similarly, if g(x) = h(xi, . . . , Xk) then L a g(x) 
is a function of (xi)i<k- 

Next, we have to check the analogue of Condition ([7]), namely 



In view of the above expression of T, this amounts to show that for all x € A„_i, 



E XiX i ( E c idifi ~ E c l d ifi) - E Cl E X%X J i 9 ^ 1 _ ^j//} 

l<i=£j<n II I l<i^j<n 



Hence it is sufficient to show that for all i ^ j, it holds 



( E c ' fa/' - a,-//)) 2 < E c ' - 



If only depends on (x k )kei then - djfj = if {i, j} n J = 0. Moreover if fj(x) = g(J2kei x k) 

then difi — djfi — also if {i,j} C I. Hence the summations on I actually only involve the sets I € I\ 
such that {i, j} n / ^ and the sets I € I2 such that card({i, j} n Z) = 1. By hypothesis the sum of 
the corresponding coefficients cj is at most one, so that the required inequality is a mere consequence 
of the convexity of the square function. Hence Condition ([7]) holds true and we get the local inequality. 
By ergodicity the inequality passes to the measure fl„_i(a). □ 

Let p > 0. Let B™ = {x E W 1 ; ^ 2 \%t\ p < 1} be the unit ball for the £ p norm on M". On the 
corresponding unit sphere 8B™ — {x £ R"; J2i \ x i\ p — 1}: one often considers the cone measure [m™ 
defined by fj%(A) = Vol„([0, l].A)/Vo\ n (B£), A C dB™. Here [0, 1] • A is the intersection of B™ with the 
cone of apex at the origin spanned by A. 

Corollary 15. Let X be a random vector on W 1 . Assume that it is either uniformly distributed on B™ 
or distributed according to the cone measure on dB" Then for all even functions fi : [— 1, 1] — > M + 
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i=l i=l 

Proof. This is deduced from a particular case of the previous result on Dirichlet distributions, which 

ensures that for Y distributed according to Z?„_i(a), and gi : [0, 1] — > M + , a similar inequality holds: 

1/2 

E H9i(Yi) < rK-^fK^i)) • Indeed the uniform measure on B™ and the cone measure on can 
be viewed as symmetrized versions of the images of Dirichlet laws by maps of the form T(x\, . . . , x n ) — 
(Ti(xi), . . . , T n (x n y). Hence if we choose gi — /joTj in the latter inequality, we get the claim. Let us make 
this strategy explicit in the case of the cone measure. Let Si, Gi, i = 1, . . . , n, be independent random 
variables. Assume that s\ is uniform on { — 1,1} and Gi distributed according to e~* dt/T(l + 1/p). 
Then it is known that the vector 

X = 



(eiGi, . . . , e n G n ) 



a 



is distributed according to the cone measure. Hence \Xi\ p = G\ /{G\+- ■ ■+Gf l ) where G\ is Gamma(l/p)- 

distributed. So applying the Brascamp-Lieb inequality for fi{x) = gi{x\ ) yields the claim. 

A similar approach is possible for the uniform distribution on B" thanks to the representation pro- 
vided in [B]. □ 



Remark 16. The cone measure on dB% is simply the uniform measure on S ra_1 , for which a similar 
inequality holds for general functions fi (i.e. it is not necessary to assume that they are even). Hence 
one way ask whether the symmetry assumption in the previous corollary is really needed. In order to 
remove it one would need a result for symmetrized Dirichlet laws, namely for measures on dB™ with 
density with respect to Lebesgue measure proportional to ]X |aJi| ai_1 - At first sight, there does not seem 
to be any problem to extend our approach. However the ergodicity of these measures is a delicate issue. 
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Indeed the fact that the density vanishes inside the domain may, in terms of the corresponding random 
process, create potential barriers that may not be crossed or potential wells into which the process may 
get stuck. On the technical level, the domain of the operator may be too small to contain enough 
non-symmetric functions. 

Remark 17. Proposition [H] and many results of this work, involve two kinds of functions which depend 
only on some coordinates (xk)kei (some depend on all these coordinates and some depend on them only 
through their sum). It is possible to consider more general dependences. We have not tried to reach the 
highest generality in this respect. Let us briefly mention a quite general extension of Proposition [14] we 
could consider functions // where I = (7i, . . . , Ik) is a collection of disjoint subsets of {1, . . . , n}, such 
that fi{x) only depends on 

Ti( x ) : = I X! Xl ' ' ■ ' ' x% ) • 

One can check that the map 7/ commutes with the Fleming- Viot operator (this uses the disjointness of 
I\, . . . , Ik)- If one considers now a collection of functions (//)/ex and corresponding coefficients (fi)iei, 
then a Brascamp-Lieb inequality holds provided for all i ^ j in {1, . . . , n}, YlieA- C/ — wnere 

A itj =[leT, 31, card(I, n {i,j}) = l}. 

The proof follows the same arguments as the one of Proposition HU We omit the details. Note that 
several results of this paper can be extended in an analogous way. 

4 Discrete models 

In this section, we deal with discrete models, and in particular we have to use the BL-condition in its 
brute form ([3]) since we are no longer working with diffusion generators. We nevertheless provide a simple 
criterion which can be worked out for a number of discrete models of interest. 

4.1 Abstract criterion 

Throughout this paragraph, E will thus be a finite or countable state space. Let K be & Markov kernel 
on E, that is, K : E x E — ► [0, oo) is such that for every x € E, J2 y eE ^{x,y) = 1. If / : E — » R is 
bounded, set Kf(x) — J2 y eE K( x i y)S{y)i x £ E. As before, for given maps Ti : E — ► Ei, i= 1, . . . , m, 
we say they commute with K if for any function / : E — > R, K{f o Tj) is a function of Ti. Again, this 
amounts to the existence of a Markov kernel Ki on Ei such that K(f o Tj) = Ki(f) o T^. This definition 
is of course equivalent to abstract one of $H in terms of the associated Markov generator 

I = K - Id. 

The next proposition provides a simple equivalent criterion for the BL-condition Q in this context. 
Proposition 18 (BL-condition in the discrete case). For distinct x,y S E such that K(x,y) > 0, set 

Ix,y = {i E {1,.. . ,m}; Ti(x) ^ Ti{y)} . 
Let Ci > 0, i = 1, . . . , m. Then the BL- condition ([3|) holds if and only if 

c i < T f or all x ^ y in E such that K(x, y) > 0. (15) 

Therefore, under this condition, for every non-negative functions fi : Ei — * R, i = l,...,m, and every 
t > 0, 

/in \ m 

pAR^otA <Y[(P t (f t o Tl )) c \ 

^i=l ' i=l 
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In particular, if K has an ergodic invariant probability measure \i and if for all x,y G E distinct with 
K(x,y) > 0, it holds card{i = l,...,m; Ti(x) ^ Ti(y)} < p, then choosing Ci = i = 1, . . . , m, we 
have that 

. rn rn , . \ 5 

/ Y[fi°Tidii<]ll (JioTydn) . 

J i=l »=1 ' 

Proof. At fixed x € E, condition ([3]) may be written as 

rn 

J2 K(x,y){e^ c M° T ^-f*° T *W - 1) < J2 K(x,y)(ef> oT ^-f> oT ^ - l). (16) 

yeE i=l yEE 

The sums over i on both sides only run over i £ so that the preceding inequality is equivalent to 
saying that 

^2 K(x,y)ip( ^2 c i[fi oT i(y) ~ fi oT i( x )]) <^2,K{x,y) ^2 c i i p{fi oT i{y)- fi°Ti(x)\ 
yEE ielx.v y^E iela.y 

where ip(u) = e u — 1. Since <p(0) = 0, we can restrict the previous sum over y 6 E \ {x}, and of course 
we can ask that K(x,y) ^ 0. Now, for fixed x,y € E with x ^ y and K(x,y) ^ 0, we argue that the 
Condition (|15[) on the c^s combines with the convexity of ip to give (pointwise) the desired inequality. 

Conversely, if (fTT>|) holds for all choices of /j, i = l,...,m, we choose fi(z) — Olz^rdx) where 
9 £ R + . Letting 9 — > +oo and comparing the orders of the terms in (|16|) shows that for each y ^ x with 
y) t^> 0, we must have J2iei x °i — 1- ^ 

Remark 19 (Extension to non-finite settings). The careful reader has probably noticed that the finiteness 
(or countability) of E is not central in the argument. All the argument works as soon as we can express 
L + 1 =: K in terms of a Markov kernel. Indeed, this allows us to reduce the problem to a pointwise 
inequality. 

We next illustrate instances of the preceding result. 



4.2 Examples 

4.2.1 Homomorphisms of finitely generated groups 

Let for example G, Gi, i = 1, . . . , rn, be finite or countable groups and 2$ : G — > G^ be homomorphisms. 
Let ^ be a Markov kernel on G. It is clear that each Tj commutes with if. 

Assume furthermore that K is left-invariant in the sense that K(gx, gy) = K(x, y) for all x, y, g £ G. 
We may let for example G be finitely generated with generating set S, and K(x, y) = Card (5) _1 ls(y~ 1 x), 
x,y £ G. Then, condition (TT5)) of Proposition [T51 amounts to 

5><1 

for every z € S where 7 2 = {i = 1, . . . , to; z £ Ker (Tj)}. 



4.2.2 Coordinates of the symmetric group 

Let E be the symmetric group S n over n elements {1, . . . ,n}, n > 2. This set is the discrete analogue 
of SO(n). Unlike the continuous setting, there are several possible choices for the kernel K. However in 
view of the latter proposition, where each couple (x,y) with K(x,y) > leads to a linear constraint on 
the exponents Cj, it is natural to take a small (or even minimal) generating set S and to consider: 

K (x, y) = -r^r if there is r £ S with y — rx. 

card (A) 

We choose for S the set of all transpositions. The following calculation will show that it is the best 
choice, since it minimizes the size of the support supp(r) = {j; r(j) ^ j}. 
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The normalized counting measure fj, is invariant for K. Actually S being stable by inverse it is also 
reversible: 

j(Kf)gdn= J ^^Y j f{rx)g{x)d l i{x)= J -^-^J{ y ) g { T -\j) d^{y) = j "(Kg) f d^. 

Let / be a subset of {1, . . . , n}. We consider the map Tj defined by 

T/(x) = x\j = (x(i)) ieI , Vx G S n . 
Then Tj commutes with K; indeed 

x(/oT/)( ' t) = ^TT^ (/oT/)(t:e) 

and Ti(tx) = (t o x)\i = t o xij depends only on Tj(x). The result of Proposition 1181 involves the 
condition T/(x) ^ Ti(y) for K(x,y) > 0. Let us formulate it in a more concrete manner: 

T 7 (x) ^ T/(rx) <S=>> 3i G I, x(i) ^ riz(i) I D x _1 (supp(r)) 7^ 0. 

Note that since the proposition involves this condition for all x E iS n , the set x _1 (supp(t)) can be any set 
with the size of the support of r. Choosing transpositions then clearly appears as the most economical 
choice. 

For Id {1, . . . , n} we may also consider the map Ri defined by 

Ri{x) = x(I) = {x(i), i G I}, VxeS n . 

Then Ri also commutes with K and for any x and any transposition r, Ri(x) 7^ Ri(tx) happens if and 
only if r moves one point in x(I) outside x(I). Hence 

Ri(x) ^ Riirx) card(/n x _1 (supp(r))) = 1. 

Combining these observations with Proposition fT51 yields a discrete analogue to Proposition fTUl 

Proposition 20. LetX be a collection of subsets of {1, . . . , n}. Assume that it is written as a disjoint 
union X = X\ U X2 • For each nonempty subset I G X, let Cj > and fj : S n — ► M + such that 

• if I G X\ then for all x, fi(x) only depends on x\j, 

• if I G Xi then for all x, fi{x) only depends on x(I). 
If for all 1 < i, j < n with i 7^ j it holds: 

E c '+ E c^ 1 ' 

In{i,j}^<S card(/n{ij'}) = l 

then the BL- condition^ is satisfied and 



fidfJ, 



j n/;-*<n(/ 

J *>* iei lei v br 

The examples given after Proposition ITOl transfer to S n . For a family X of subsets of {l,...,n}, 
introduce the exponents: 

p = maxcard({/ el; i E I, or j E I}) and q = maxcard({7 G X; card(I n {i,j}) = 1}). 
Then, for functions gi and hi defined on suitable sets, we have 

/ Hgi(<r\i)dn(a) < QiW , 

5 " lex lex \J s ™ ' 

[ Hhj(a(I))d^a) < [](/ hjiailffd^y . 
5 " lex lex ^-JSn / 

A particular case of interest (where these two cases coincide) is when X = {{1}, . . . , {n}}. Then, 
p = q = 2 and we recover the inequality on permanents given in |12j . 
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4.2.3 Slices of the discrete cube and multivariate hypergeometric distributions 



For n > k > 0, let 



{x G {0, 1}™; xi H hi„ = k 



} 



equipped with uniform measure. These sets are discrete analogues of the sphere S n . Two elements x, y 
in fl n ^k are neighbors if and only if they differ on exactly two coordinates, a relation written as x ~ y. 
Let K be the nearest neighbor random walk on Q. ni k (known as the Bernoulli-Laplace model) defined by 



It is easy to check that K f(x) only depends on the i'th coordinate Xi of x if this is the case for /. Indeed, 
the number of neighbors y of x such that yi — Xi is equal to (k — xi)(n — 1 — k + Xi), whereas when 
yi = 1 — Xi, this number is equal to the number of coordinates Xj, j ^= i, such that Xj = 1 — Xi. For the 
coordinate maps Ti(x) — Xi, 1 < i < n, we are thus in the preceding setting of commuting operators so 
that Proposition fT51 applies with p = 2. 

Alternatively one can use the following observation, which was pointed out to us by P. Caputo. The 
uniform probability measure on U, n ^ is the image of the uniform probability measure on the permutation 
group S n by the map x G S n i— > {^- x (i)<k)i<i<n- Consequently the correlation inequalities derived on S n 
for functions depending on blocks of coordinates pass to fi n) fc to yield the same result. Such a reasoning 
may be extended in order to encompass more general distributions. Consider integer numbers K < M 
and m = (J7ii)i<i< n such that m, = M. The multivariate hypergeometric distribution H.(m,K) is 
defined on N™ by 



if ki + • • • + k n = K and for all i, ki < rrii and H(m, i"T)({(fei, . . . , k n )}) = otherwise. Given an urn 
containing M balls of n different colors, and more precisely mj of the i th color, if one draws K balls 
(uniformly) at random then the n-tuple {X\, . . . , X n ) consisting of the numbers of balls of each color in 
the sample is H(m, if )-distributed. It is not hard to check that H.(m, K) coincides with the image of the 
uniform probability law on the permutation group Sm by the map 



This observation can be used to show that Proposition [TJ] remains valid if one replaces the Dirichlet 
laws by multivariate hypergeometric distributions. We only outline the proof. Starting from functions 
// defined on the support of H(m, K), we consider the functions gi := fj o T. Note that gi(cr) depends 
on the images by a of several intervals of {1, ... , M}. Applying Proposition [20] directly would not give 
the right result, since it only deals with simpler forms of dependences. Hence we need to go back to 
Proposition [TH in the spirit of the proof of Proposition [2"U1 (this is actually related to Remark [T?|) . We 
omit the details. 

4.2.4 Product spaces and Finner's theorem 

Let us go back to more general distributions (including continuous distributions on non-finite spaces) 
but in the context of product structures. The hypotheses in Propositions [TU1 [2TJ1 or [T51 are reminiscent of 
Finner's theorem |15j which expresses that if E = X\ x ■ ■ • x X n is a product space with product probability 
measure /i = v\ ® • • • ® v n , and if, for i = 1, . . . , m, Ti : E — > Ei is the coordinate projection on the 
space Ei :— Yijes determined by Si C {1, . . . , n}, then for any non- negative functions fi : Ei — > R, 





a 




i = 1, . . . , m, 




provided that 




Cj < 1 for every j = 1, . . . ,n. 
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This statement is actually contained in Proposition [TH] for a suitable choice of the kernel K. Without 
loss of generality, we may assume that, for each i, Xi is a finite set equipped with a probability measure 
Vi that charges all points. Consider the kernels Ki on Xi given by Ki(xi, j/j) = fj(t/i), and tensorize them 
to the product space E = X\ X • • • X X n by 

I " 

K= - > I<8) - • • <8>7<8> 7Q <8> J® •• ■ ® 7 
n ^— ' 

i=l 

where 7 is defined on 75j by I(xj,yj) — l x .— y . (in other words, the associated Markov operator is the 
identity). The commutation property of the projection operators Ti is obvious. Moreover, for distinct 
elements x, y in E, K,(x, y) > if and only if x = {x\, . . . , x n ) and y — (yx, . . . , y n ) differ at exactly one 
coordinate, say j. Now the set of i's such that Ti(x) ^ Ti(y) is exactly the set of i's such that Si 3 j. 

In particular, the preceding kernel provides a proof of the classical Holder inequality on the finite 
space X equipped with the probability measure v, and by approximation on any finite measure space. 



5 Sums of squares 

In this short paragraph, we briefly illustrate how the ideas developed in the preceding discrete setting 
may also be of interest for classes of diffusion generators. Assume the generator 7 is a sum of squares of 
vector fields on a manifold E, 

e 

Let for example Tj : E — ► R fci , i = 1, . . . , to, be commuting (with L) maps. We interpret X{Fi coordinate 
by coordinate. The criterion put forward in Proposition [TH] then adapts to this setting: 

Proposition 21. For every I, let Ig :— {j e {1, . . . , m}; X{Fi ^ 0} . Let Ci > 0, i = 1, . . . , to, be such 
that 

(k < 1 for every £. 

i€h 

Then, for every non-negative functions /, : Ei — * R, i = 1, . . . , to, and every t > 0, 

p t(tlft°T i \<'[[(P t (f i oT i )) et . 

M=l ' i=l 

In particular, if for all I, card {i = 1, . . . , to; X(Fi ^ 0} < p, we may choose Ci = ~, i = 1, . . . , to. 
Proof. Since T(/) = X^(^/) 2 > according to Fact[H the BL-condition Q takes the form 

m 

^ff) 2 <^c^(l f (/,oT,)) 2 , (17) 

£ i=l e 

where we recall that 77 = ^i=i c «/i ° ^i- Hence we are done if we can prove that for every £, 

(^^(/oTj) <^c 4 (x,(/or,)) 2 . (18) 

i=l i=l 

If /j is a function on M' Ci , then Xi(fi o Tj) = (X(Ti, V/j(Tj)) is zero when i ^ 7^. Hence the summations 
in the above inequality only hold on i € /<. Since, by hypothesis Y] ie j <H < 1, Inequality (|18p is valid 
by convexity of the square function. The conclusion follows. □ 

We illustrate this result in the context of the Loomis- Whitney inequalities on the sphere. Consider 

A = \ ^ x li = l^2[xkdi - Xid k } 2 

k,l k,£ 
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the Laplace operator on the sphere S n 1 C M". Let A be a subset of {1, . . . ,n} with d elements, and 
consider T : E™ -> M d defined by T(x) = {xi) ieA . Then A^T^ = if and only if {kj} C\ A = $. Thus, 
for every k, £, 

p = card {AN =d;X k , e T A ^0} 




One instance of application is d = 1 (for which p = 2) from which we recover inequality fl} involving 
functions of Tj(or) = x,. The approach here is indeed very close to the one of Carlen, Lieb and Loss [IT]. 

Remark 22. This viewpoint best explains the analogy between the results on SO(n) and S n . Indeed the 
infinitesimal rotation Xkdg — xgdk in vect(efc, eg) is the analogue of the transposition Tk.e- 



6 Entropy of marginals 

In this section, we investigate, from the abstract Markov operator point of view, descriptions of the 
Brascamp-Lieb inequalities and entropy inequalities for marginals following (lllllOj . As in Section we 
do not make precise the classes of functions under consideration. 

Let (.E, ^) be a probability space and Ti : E — > Ei be measurable maps. Given a probability density 
/ on E with respect to /i, denote by fi its conditional expectation with respect to In other words, 
fi is the unique probability density on E with respect to fj, such that, for every bounded measurable 
<p : Ei -> R, 

fipoTid^= fipoTidfJ,. (19) 



(Since fi — hi o Tj for some hi : Ei — > K, hi may be thought of as the "marginal" of / in the direction of 
Tj.) As shown in [TTJ, the Brascamp-Lieb inequality © may be used, by standard arguments, to prove 
the entropy inequality for the probability density / 

X> fi l °S f ^ < / / lQ g f d »- (2°) 
i=i ^ ^ 

A recent work by Carlen and Cordero-Erausquin [10] shows that there is a full equivalence: 

Proposition 23. The following are equivalent. 

(i) For every non-negative functions gi : Ei — > M, i = 1, . . . , m, 



/ n -^ Ci ° - n ( / ^ ° 

J i=l i=l 



fzzj For every probability density f with respect to fi, 



f log fdfi>^2ci J /ilog fidfi. 

i—l 

Since semigroup proofs are available for Brascamp-Lieb inequalities, it is natural to hope for semigroup 
proofs of entropy inequalities. Such an approach was suggested in [S] for spherical measures, on the basis 
of the corresponding inequality for the Fisher information. In the remainder of this section, we discuss 
the extension of this argument to the abstract framework. 

Let L be a Markov generator on E with semigroup (Pt)t>o- We require that L be invariant, symmetric 
and ergodic for [i. Denote by T the carre du champ operator of L as defined in ©. Hence, the Dirichlet 
form is expressed as follows 

£(f,9) = J r (/>s) d n = - J f Lgdfi = - J gLfdfi. 
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It is classical that, under suitable domain assumptions, 

/p OC p 
f 'log fdn = J dt J T(P t f,\ogP t f)dfi. (21) 

The Fisher information of a function / > is defined by 

J(f) :=£(/, log/). 

The above equality (|2"Tj) becomes 

J f log fd» = J o J(Ptf)dt 
and so, in view of the commutation between Ti and G, which ensures that 

Ptifi) = (Ptf)i, 

we see hat the entropy inequality (|20|1 may be derived from its analogue for the Fisher information, 

The next result shows that such inequality for Fisher information can indeed be derived directly from 
the BL-condition in our abstract setting. In view of the previous discussion, this therefore provides a 
different route for proving Brascamp-Lieb inequalities. 

Theorem 24 (Superadditivity of Fisher information). Assume that L is a Markov generator on E which 
commutes with the maps Ti and that the BL- condition ([3]) holds. Then, for every probability density f 
on E with respect to /i, under the preceding notation, 

m 

E C * J (/*)<W (22) 

i=l 

Before proving this result in full generality, let us note that in the case where L is a diffusion, this 
theorem can be derived easily, following ideas from [5]. Indeed, when L is a diffusion we have 

Using the definition of the conditional density (fT9|) and the chain rule formula for L we see that, for each 
i < m, 

J{fi) = ~ j fiLQogfi) d/* = - J fLQogfi) d^ = J dfl . 
Using the Cauchy-Schwarz inequality and (|19[) again we get 

J(/l) -JJfW dfi J -JT d » = l 7W*i — d »' 

which means that 

j{fi)< mM^d»- 



/r(/0 

We conclude to (f2"2f after noticing that condition can be expressed in dual form as 

r(f). 

Similar strategy however does not work in the non-diffusion case. We present below a new method 
that allows us to treat the general case of a Markov generator. It relies on the following observation 
which is of independent interest. 

Lemma 25. Assume L is a Markov generator symmetric for /i. Then for functions f > and H of 
arbitrary sign on E we have 

E (/, H )<£(/, log /) + J fe- H L(e H )d^. (23) 
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In other words we have the following dual formulation of Fisher information: 



J(f) = sup jf (/, H)-J fe- H L(e H ) d/ij 



Proof. We introduce the operator P := L + Id which is, as L, symmetric on L 2 (fi). Replacing L by 
P — Id we see that the inequality ([25]) to be proven rewrites as 



/" f\H - PH]d^ < J flog fdfi - I (Pf) log fdfx ~ Jfdfi + J fe- H P(e H )d f i. (24) 

By symmetry, the left-hand side is equal to J[P(fH) — HPf]dfi. By Young's inequality ab < a log a — 
a + e b , a > 0, b € R, we get that for every A > 0, 

P(/H) = Ap({ ff) < P(/ log /) - (P/) log A-P/ + AP(e ff ). 



Hence, choosing A = /e H , 

P(/if) - #P/ < P(/log/) - (P/) log / - P/ + fe- H P(e H ). 
The desired inequality f2"4"|) follows after integration, since for every 5 we have J Pg d/i — J g d/i. □ 

With the previous lemma in hand, we can easily complete the proof of the theorem. 
Proof of Theorem \24\ Note that the conditional expectation property yields, for every i = 1, . . . , m, 

J(fi) = £ (/(, log h) = ~ J fi LQog fi)dn = - J f L(log /,) ^ = f (/, log /i). (25) 

Hence 

m m 
i=l i=l 

where H = Y^iLi c » l°g/i- Combining Lemma [25] and BL-condition (J3j> (written for Fj = log/j which is 
a function of Tj) we get 

f(/,^) < £(/,log/)+ /" fe- H L(e H )d t i 
< Af) + J fJ2 C iJ. L (h) d » 
= J(f) + J2 c > J L(fi)d» = J(f), 

i 

where we have used in the last step that L(fi)/fi is a function of Tj and the conditional expectation 
property HH). □ 

Superadditive inequalities for Fisher information were considered on the sphere S"™ -1 C M. n in [S] 
in the case of Tj = Pe 4 with the ^ for subspaces J5j C E n satisfying J2i c i^Ei < Wr-. As explained 
in ^3.2.2[ the BL-condition ([3]) is verified for d{ — Ci/2 and we recover by the previous proposition the 
inequality from [S]. 

In the discrete case, some examples of superadditive inequalities for Fisher information were implicitly 
obtained in the papers [TBI HTj . The goal of these papers is to prove modified log-Sobolev inequalities 
of the form 

V/ : E - R+ with [fdn = l, p J f log / df, < £(f, log /). 

As pointed out to us by Eric Carlen, one can extract from their proofs (which is by induction) super- 
additive inequalities for Fisher information which constitute a central technical ingredient. The main 
examples considered in theses papers are the symmetric group and slices of the discrete cube. There, 
the marginals are considered with respect to maps Tj which belong to the family studied in the previous 
section, for which we have proved that BL-condition ([3]) holds, and for which we therefore have the 
desired superadditive inequalities. 
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